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Abstract Linking relations described in text with relations in 
visualizations is often difficult. We used eye tracking to mea- 
sure the optimal way to extract such relations in graphs, col- 
lege students, and young children (6- and 8-year-olds). 
Participants compared relational statements (“Are there more 
blueberries than oranges?”) with simple graphs, and two sys- 
tematic patterns emerged: eye movements that followed the 
verbal order of the question (inspecting the “blueberry” value 
first) versus those that followed a left- first bias (regardless of 
the left value’s identity). Question-order patterns led substan- 
tially to faster responses and increased in prevalence with age, 
whereas the left-first pattern led to far slower responses and 
was the dominant strategy for younger children. We argue that 
the optimal way to verify a verbally expressed relation’s con- 
sistency with visualization is for the eyes to mimic the verbal 
ordering but that this strategy requires executive control and 
coordination with language. 

Keywords Eye movements and visual attention • Relations • 
Visualizations 
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Relational comparison is fundamental to higher-level cogni- 
tion and consequently has received substantial attention in 
research (Halford, Wilson, & Philips, 2010; Hummel & 
Holyoak, 2005; Gentner, 1983). Many relational judgments 
require comparison of multiple types of relational representa- 
tions, including visuals and text (Ainsworth, 2006; Bergey, 
Cromley & Newcomb, 2015). Assembling furniture requires 
integrating a diagram with verbal instructions, and solving a 
mathematical word problem can require matching verbal re- 
lations to a drawing. More generally, a fundamental part of 
scientific reasoning is the ability to compare evidence with 
theories (Kuhn & Pearsall, 2000), and the former is typically 
expressed through visualizations and the latter through verbal 
descriptions. Although all of these comparisons require people 
to compare particular visual relations to particular 
verbalizations, there is strikingly little work on how our visual 
system accomplishes this task. We explored the case study 
comparing relations in visually presented data in graphs to 
verbally described relations. 

Extracting visual relations is surprisingly complex. Take 
the minimally simple bar graph in Fig. 1. At first glance, it 
contains a single relation between two objects. Closer inspec- 
tion reveals far more. The two bars contain three dimensions 
that could be important: magnitudes (small and large), colors 
(blue and orange), and spatial locations (left and right). 
Among these six values, there are a surprising number of 
potential relations that a graph reader might extract 
(Table 1). A viewer could focus on magnitude relations based 
on color (e.g., the blue bar is larger), color relations based on 
magnitudes (e.g., the larger bar is blue), or magnitude relations 
based on spatial locations (e.g., the left bar is smaller). Table 1 
shows the 6 total variations in these descriptions, which dou- 
bles to 12 when each of these descriptions is inverted (e.g., the 
last example could instead state that the right bar is larger). 
The number doubles again to 24 when each is rephrased as a 


Published online: 13 May 2016 


® Springer 


Author's personal copy 


a 


Are there more 
blueberries than 
oranges? 



b 


Spatial Location 
Order Routine 




0 


Identity (Color) or 
Magnitude Order 
Routine 


I 


Fig. 1 (a) Example trial sequence. Participants first viewed a question 
about a magnitude relation in an upcoming graph, followed by a brief 
delay and a two-bar graph. The colors of the bars in the graph represented 
different types of fruits (e.g., orange = oranges, blue = blueberries), (b) 
Schematics showing three possible visual routines during graph viewing: 
spatial location (i.e., first saccade towards left bar; top), identity or 
magnitude (i.e., first saccade toward bar associated with the relational 
target of the question or toward the taller bar; bottom). Gray dots represent 
eye fixations 


question, such as “which color is farther to the right?” Our 
graph only has two bars. It is no wonder that younger students 
have such difficulty dealing with graphs (Friel et af, 2001) 
and why advanced students still struggle with more complex 
graphs. Even experienced graph readers find graph compre- 
hension difficult, taking time and effort equivalent to reading a 
paragraph (Carpenter & Shah, 1998). 

Addressing the challenges of comparing visual and verbal 
relations requires that we understand how the expert viewer is 
able to choose between alternative interpretations of a visual 
relation. We argue that the many different ways of interpreting 
relations in a graph are not simply different ways of thinking 
about patterns in the graph but different ways of seeing those 
patterns. Specifically, we argue that the visual routine used by 
a viewer (Ullman, 1984) — the sequence of a graph’s values 
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that they attend to over time — constmcts the relational expres- 
sions that they see. We use eye tracking to determine the 
routines that most efficiently extract a given relational descrip- 
tion. We report the results of a set of experiments that reveal 
the presence and importance of these routines and their devel- 
opment across three age groups: 6-year-olds, 8-year-olds, and 
college students. 

Seeing different descriptions of relations within a graph is 
likely a serial visual routine, because its more basic subpro- 
cess — perceiving categorical visual relations — seems to be a 
serial process. Finding a pair of objects in a particular relation 
(e.g., a plus above a minus sign) within a display of oppositely 
arranged pairs is tediously slow, perhaps requiring inspection 
of single pairs at a time (Logan 1994, 1995; Wolfe, 2001). 
Recent work suggests an even tighter capacity limit for rela- 
tional processing, suggesting that we must inspect each indi- 
vidual object within a pair to overcome the visual system’s 
severe limitations in matching objects with their properties 
(Franconeri et al., 2012; Luck & Ford, 1998; see Hummel & 
Biederman, 1992, for a more optimistic proposal). 
Electrophysiological data confirms this prediction: when peo- 
ple judge relations (do you see a green object to the left of a 
red object, or vice-versa?), they systematically shift their at- 
tention to isolate one of the objects, suggesting that moving 
the eyes left may produce the relational description “green on 
the left,” whereas moving to the right may produce “red on the 
right” (Franconeri et al., 2012). 

Past work on scene perception is consistent with a close 
link between the focus of attention and relational descriptions 
(Griffin & Bock, 2000; Papafragou et al., 2008). Cueing at- 
tention to a given object can change the way that people de- 
scribe its relation within a scene. In a scene depicting a dog 
and a man running, people were more likely to describe it as 
“The dog chases the man” when the dog was subliminally 
cued and as ’’The man flees the dog” when the man was cued 
(Gleitman et al., 2007; Roth & Franconeri, 2012). In another 
study, people were faster to verify that an image contained "a 
star above a plus” compared with "a plus below a star” despite 


Table 1 Twelve descriptions of possible relations present in the graph in Fig. 1. The descriptions are based on all possible combinations of the three 
dimensions (color, magnitude, location) of the two bars. Framing each as a question doubles the number of potential candidates 

Dimension 



Color 

Magnitude 

Location 

Descriptor 

Orange 


The orange bar is smaller 

The orange bar is on the left 

Blue 

- 

The blue bar is larger 

The blue bar is on the right 

Smaller 

The smaller bar is orange 

- 

The smaller bar is on the left 

Larger 

The larger bar is blue 

- 

The larger bar is on the left 

Left 

The left bar is orange 

The left bar is smaller 

- 

Right 

The right bar is blue 

The right bar is larger 

- 
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the logical equivalence of these two statements (Clark & 
Chase, 1972). This preference for framing a relation as 
“above” seemed to be due to an attentional bias for the top 
object (which is common in other tasks, see Xu & Franconeri, 
2015); the effect weakened under instructions to attend to the 
bottom. 

These results are all consistent with the idea that the way 
that a viewer isolates objects with attention over time can con- 
trol the relational description that they extract from a scene. For 
the graph in Fig. 1, when asked, “Are there more blueberries 
than oranges?” — what’s the optimal way to move attention to 
answer the question? There are three salient types of routines: 

• Magnitude : Inspect the larger bar to verify whether it is 
blue. If this is the routine used, we would predict that the 
viewer should seek the larger bar, because seeking affir- 
mations of a sentence is more efficient than seeking nega- 
tions (Clark & Chase, 1 972), and the sentence relies on the 
word “more” as opposed to “fewer.” 

• Sentence ordering of object identity (color): Inspect the 
blue bar and verify that it is larger. We would predict that 
in this case the viewer should seek the blue bar, because 
comparing the first-mentioned target (“blueberries”) of 
the visually depicted relation is more efficient than com- 
paring the reference (“oranges”) object (Roth & 
Franconeri, 2012, for review). 

• Spatial Location : Inspect the left bar and verify that it is 
smaller. We would predict that expert graph viewers 
should not use this strategy, because the x-axis is categor- 
ical, not ordinal, for graphs in the current study; thus, the 
spatial locations of the bars should be irrelevant (although 
they might be relevant for graphs where the x-axis indi- 
cated time or another ordered dimension). Despite its in- 
efficiency, some viewers might use this strategy with a 
leftward bias, based on the pervasiveness of the left bias 
in scene-viewing and English reading order (Dickinson & 
Intraub, 2009; Tversky et al., 1991). The left bias even 
appears to extend to graph reading. In past work using a 
task similar to ours, participants were fastest to verify that 
a graph contained a certain trend, e.g., “A is greater than 
B,” when the left-to-right order of the bars matched the 
order of the sentence (Feeney & Webber, 2003). 

We tested for the presence and effectiveness of these three 
routines using displays similar to those in Fig. 1 . We used eye 
tracking as a proxy for the locus of visual attention, measuring 
the direction of the first saccade to either bar, and we used 
response time to simple questions about magnitude relations, 
e.g., “Are there more blueberries than oranges?” as a measure 
of the effectiveness of the routine. We predicted that efficient 
graph reading would reflect the magnitude or identity 
routines and that the spatial location routine should lead 
to poor performance. 


Experiment 1 
Method 

Participants We collected data from 17 adults (14 females, 
mean age = 23 years) with normal or corrected-to-normal 
vision. Participants were compensated $10. Data from one 
participant was discarded, because eye movements occurred 
on only 25 % of trials. Sample sizes were determined a priori 
to be consistent with participant numbers used in previous 
sentence-picture verification (Clark & Chase, 1972) and eye- 
tracking studies (Papfragou et al., 2008). 

Stimuli and apparatus Stimuli were presented using SR- 
Research Experiment Builder on a ViewSonic E70fB CRT 
monitor (1,024 x 768 pixels, 85-Hz refresh rate). The display 
subtended 23.8° x 17.2° at approximately 81 cm away from 
the monitor. Eye movements were recorded by an EyeLink 
1000 Tower-Mounted eye tracker (1000-Hz sampling rate). 
The bars in the graph represented types of fruits (blueberries, 
limes, and oranges). The bars were the same width (1.5°) but 
differed in height; the shorter bar length was always 1 .9°, and 
the taller bar length was always 5.5°. The bars were separated 
by 3.5° and appeared on unlabeled x- and y-axes. 

Procedure The experiment began with a 9-point grid calibra- 
tion sequence in which participants tracked a moving dot. 
Figure 1 shows a typical trial sequence. Participants first 
viewed a question about a magnitude relation (e.g., “Are there 
more blueberries than oranges?”). Questions were phrased 
equally often as more and fewer relations, and each fruit was 
assigned equally often as the relational target and reference 
object. Participants pressed a key when they were ready to 
view the graph. A fixation dot appeared at the center, and once 
participants pressed the spacebar and their eyes were posi- 
tioned within 2° of fixation, a graph appeared. The depicted 
relation matched the question on half of the trials. The bar 
associated with the target appeared equally on the left and 
right sides. Participants were instructed to judge the relative 
heights of the bars (i.e., the taller bar represented a larger 
quantity). Participants responded “yes” or “no” by pressing 
one of two keys labeled “Y” and “N” as quickly but as accu- 
rately as possible. Based on the combination of fruit type, 
question quantifier, taller bar location, and correct response, 
there were 24 trials total divided into two blocks of 12 trials. 
Participants completed a second calibration sequence between 
the two blocks. 

Results 

Accuracy Only trials in which saccades occurred (86 % of all 
trials) were included in all behavioral analyses. Mean accura- 
cy on the task was 91.7 %. 
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Eye movements We only analyzed first saccades, excluding 
saccades that did not originate at central fixation (i.e., the start 
position was outside of a 1.7° window around central fixation) 
or that did not end on the display screen, which totaled 6 % of 
all first saccades. We examined saccades during graph view- 
ing to test for biases. Saccade end positions were coded rela- 
tive to central fixation (0°) in a binary fashion based on four 
bar properties: location (left = negative, right = positive), mag- 
nitude (short = negative, tall = positive), relational category 
(reference = negative, target = positive), and magnitude con- 
gruency, i.e., whether the bar’s magnitude was incongruent 
(negative) or congruent (positive) with the question’s quanti- 
fier (more/fewer). We tested for biases in the distribution of 
first saccades by comparing the average saccade size to 0°. 

Average saccade biases are shown in Fig. 2a and b. 
Participants systematically looked first toward the relational 
target bar (M sentence-order bias = 0.83°), 95 % confidence 
interval (Cl) [0.57° 1.08°], *(15) = 6.36, p < 0.001, Cohen’s d 
= 1 .59 (Fig. 2b). There were no systematic biases based on bar 
magnitude or magnitude congruency (both *’ s < 0.5) and only 
a mild tendency to look toward the left bar first (M spatial bias 
= -0.26°), 95 % Cl [-0.6° 0.08°], *(15) = 1.49,/? = 0.157, 
Cohen’s d = 0.37 (Fig. 2a). Thus, college-age participants 
tended to explore the graph using a visual routine that mim- 
icked the color order of the question, regardless of whether the 
target bar appeared on the left or right side of the graph. 

Individual differences in visual routines To explore the be- 
havioral consequences of visual routines, we conducted two 
correlation analyses; one tested for an association between 
participants’ average spatial bias and average response time 
(RT) and one tested for an association between average 
sentence-order bias and average RT (Fig. 2a and b). We ex- 
cluded trials for which the RT was greater than 2 standard 
deviations above the mean RT for each participant and all 
incorrect trials; in total, 13 % of the dataset was excluded. 
There was a weak correlation between spatial bias and RT, r 
= -0.24, *(14) = 0.93, p > 0.25, whereas the correlation be- 
tween sentence-order bias and RT was stronger, r = -0.49, 
*(14) = 2.12, p = 0.05, such that larger biases toward the 
sentence target bar were associated with faster responses. 
However, sentence-order biases were not significantly more 
predictive of average RTs than the spatial biases, Fisher’s z = 
0.84,/? > 0.25. Thus, although only the identity routine was 
related to participants’ response efficiency, it was not a statis- 
tically stronger predictor of responses than the spatial location 
routine. 

Discussion 

College-age viewers primarily extracted graph relations by 
inspecting bars using the sentence ordering of the bars’ iden- 
tity (color). Furthermore, individuals who used this identity 


routine more systematically responded more quickly. The 
identity routine requires more attentional control, because 
the syntax of the question dictates which bar participants 
should attend to first and the viewer must inhibit a default 
routine to explore from left to right. Thus, individuals who 
have underdeveloped attentional control, such as young chil- 
dren (Diamond, Kirkham & Amso, 2002), may be less likely 
to use the identity routine to explore the graph. We repeated 
Experiment 1 with 6- and 8-year-olds to test whether children 
would be more likely to use default spatial location routines 
and whether larger spatial biases would be associated with 
slower responses. 

Experiment 2 
Method 

Participants We collected data from thirteen 8-year-olds (10 
females, mean age =102 months) and seventeen 6-year-olds 
(11 females, mean age = 78 months) who were recmited from 
the greater Chicago area by contacting their parents via email, 
direct mail, and pamphlets from the lab. All participants had 
normal or corrected-to-normal vision. Each child received a 
$10 bookstore gift card as compensation. Data from four 6- 
year-olds were discarded; three children did not follow task 
instructions (one child performed at chance, and two children 
responded which fruit there was more of instead of responding 
“yes” or “no”), and one child did not maintain central fixation 
before viewing the graphs. 

Stimuli and apparatus Participants were tested in a different 
laboratory room than adults to better accommodate the chil- 
dren. Stimuli were presented using SR-Research Experiment 
Builder on a ViewSonic E70fB CRT monitor (1,024 x 
768 pixels, 85-Hz refresh rate), with the display subtending 
27° x 19.8° at a distance of approximately 70 cm from the 
monitor. Eye movements were recorded by an EyeLink 1000 
Desktop Mount eye tracker (1000-Hz sampling rate). The 
stimuli were identical to those in Experiment 1 . 

Procedure The procedures were identical to Experiment 1 
with a few exceptions. The experimenter verified that partic- 
ipants understood the color-fruit mappings (e.g., blue bar = 
blueberries) and the magnitude mappings (e.g., taller bar = 
greater quantity). We modified the calibration so that it was 
framed as a game (see Supplementary Materials for details). 
This procedure was run before and after each block. The ex- 
perimenter read each question out loud to ensure that partici- 
pants understood the question. Once participants communicat- 
ed that they were ready, the experimenter initiated presenta- 
tion of a central fixation image, which consisted of several 
colored concentric circles (diameter = 1.7°). Participants were 


<£) Springer 


Author's personal copy 


Psychon Bull Rev 


Spatial Routines 




Left Bar Saccade Size o Right Bar 



Fig. 2 Eye movement results for Experiments 1 and 2. Dashed outlined 
bars represent graph stimuli at approximate locations of graph bars in the 
display and are not drawn to scale. The x-axes depict average saccade 
biases from 0 (initial center fixation) towards graph bars according to two 
potential eye movement patterns, (a), (c), and (e) present first saccades 
according to their horizontal spatial bias: first saccades directed left of 
center fixation plotted to the left versus those directed to the right of 

instructed to fixate this image as best they could. The graph 
appeared either once the child fixated the central image within 
1° of center or once 2 seconds elapsed, whichever occurred 
first. After viewing the graph, the child responded to the ques- 
tion by saying “yes” or “no.” Verbal responses were recorded 
and timed with a stopwatch by a second experimenter who 
was unaware of the predicted outcomes of the experiment. To 
calculate RTs, the start time began once the graph appeared 
and ended once the child finished speaking. 

Results 

Accuracy Only trials in which eye movements occurred were 
included in all behavioral analyses. Mean accuracy was 
96.7 % for 8-year-olds and 84.4 % for 6-year-olds. We con- 
ducted a one-way ANOVA on accuracy data with age group as 
a between-group factor (including adults, whose mean accu- 
racy was 91.7 %). There was a significant effect of age group, 
F(2,40) = 4.96, p = 0.012, r| p 2 = 0.20; post-hoc tests revealed 


Identity Routines 



Reference Bar saccade size o Target Bar 




fixation plotted to the right, (b), (d) and (f) present first saccades toward 
the relational “reference” bar plotted to the left versus those directed to 
the relational “target” bar plotted to the right. Each point shows the 
average saccade bias and average response time (y-axis) for a single 
participant. The vertical dotted line shows the average saccade size bias, 
collapsed across participants. Error bars are based on 95 % confidence 
interval 


that 8-year-olds performed significantly better than 6-year- 
olds, £(25) = 2.93, p = 0.007, Cohen’s d= 1.12, and marginally 
better than adults, £(27) = 1.89, p = 0.070, Cohen’s d = 0.7, 
who trended towards higher performance than 6-year-olds, 
£(28) = 1.65, p = 0.111, Cohen’s d= 0.6. 

Eye movements We only included trials that contained unam- 
biguously measurable saccades (i.e., the saccade start position 
occurred within a 1.7° window around the central fixation 
image), which occurred on 97 % of trials for 8 -year-olds and 
on 79 % of trials for 6-year-olds. 

We calculated the average size of first saccades based on 
various target properties in the same way as Experiment 1 to 
test for overall biases. In contrast to adults, 8-year-olds exhib- 
ited a significant left-bar- first bias (M spatial bias = -0.91°), 
95 % Cl [-1.25° -0.56°], £(12) = 5.07 ,p < 0.001, Cohen’s d = 
1.40 (Fig. 2c). Eight-year-olds also showed a significant 
target-bar- first bias (M sentence-order bias = 0.36°), 95 % Cl 
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[0.12° 0.61°], *(12) = 2.89, p = 0.014, Cohen’s d = 0.80 
(Fig. 2d). No other biases were observed (all *’ s < 1.3). 

Similar to 8-year-olds, 6-year-olds showed a significant 
left-bar-first bias (M spatial bias = -0.86°), 95 % Cl [-1.21° 
0.50°], *(13) = 4.15, p < 0.001, Cohen’s d= 1.27 (Fig. 2e) and 
a trend for a target-bar-first bias (M sentence-order bias = 
0.14°), 95 % Cl [-0.05° 0.33°], *(13) = 1.46,/? = 0.168, 
Cohen’s d = 0.55 (Fig. 2f). No other biases were observed 
(all *’s < 1). 

To test for developmental differences in overall biases, we 
conducted one-way ANOVAs on spatial and sentence-order 
biases with age group as a between-subjects factor (including 
data from Experiment 1). For spatial biases, there was a main 
effect of age group, F( 2,40) = 4.24, p = 0.021, r| p 2 = 0.17; 
adults had significantly weaker leftward biases than both 6- 
year-olds, *(28) = 2.36, p = 0.026, Cohen’s d = 0.86, and 8- 
year-olds, *(27) = 2.56, p = 0.016, Cohen’s d = 0.96, but 
spatial biases did not differ among 6- and 8-year-olds, *(25) 
= 0.21, p >0.25. Additionally, there was a main effect of age 
group on sentence-order biases, F(2,40) = 9.05,/? < 0.001, r| p 2 
= 0.31, such that adults had significantly stronger sentence- 
order biases than both 6-year-olds, *(28) = 4.14, p < 0.001, 
Cohen’s d = 1.51, and 8-year-olds, *(27) = 2.54, p = 0.017, 
Cohen’s d = 0.95. Eight-year-olds had marginally stronger 
sentence-order biases than 6-year-olds, *(25) = 1.41, p = 
0.171, Cohen’s d= 0.54. 

Thus, viewing strategies for extracting graph relations 
changed over the course of development. On average, both 
6- and 8 -year-olds relied on the spatial location routine (at- 
tending to the left bar first), whereas adults showed a weaker 
left-bar- first bias. Additionally, both adults and 8 -year-olds 
used the identity routine at rates above chance, but 6-year- 
olds did not. However, adults inspected the target bar first 
more systematically than 8-year-olds. Critically, these find- 
ings suggest that default visual routines diminish with age 
and are replaced with better routines that respect a task’s 
context. 


bias for 6-year-olds, Fisher’s z = 1.97,/? = 0.049. Thus, the 
more systematically children attended to the left bar first, the 
slower their response times were overall. In contrast, children 
who more successfully used the identity routine tended to be 
faster overall; however, only 8-year-olds showed this pattern. 

Effect of target bar location We hypothesized that systemat- 
ic left-bar- first routines would lead to slower responses partic- 
ularly when the target bar appeared on the right side of the 
graph, because the bar colors would be encoded in the oppo- 
site order as the question. In support of this idea, we generally 
observed stronger correlations between sentence-order bias 
and average RT for displays in which the target bar appeared 
on the right than the left side (for details, see Figure S2 in the 
Supplemental Materials). However, the distribution of spatial 
biases differed between adults and children for target-right 
displays; whereas most adults tended to inspect the right bar 
first, most children showed strong left-bar-first biases. 

To further probe how the target bar location affected RTs, 
we conducted a more fine-grained analysis of individual trials. 
For each age group, we divided RTs across participants into 
two categories based on speed of response: fastest (lowest 
third percentile) and slowest RTs (highest third percentile). 
We measured the distribution of fastest and slowest trials for 
which first saccades were launched toward the left versus right 
bar. Importantly, we performed this analysis separately for 
displays in which the target bar appeared on the left versus 
right. 

Figure 3 plots first saccades color-coded by RT category 
(green = fastest, red = slowest) for each age group and display 



Individual differences in visual routines We correlated av- 
erage RTs with spatial and sentence-order biases for 6- and 8- 
year-olds (Fig. 2c-f). We excluded RTs that were greater than 
2 standard deviations above the mean RT for each participant 
and incorrect trials (totaling 12.7 % of all data). Spatial biases 
were marginally correlated with RTs for 8-year-olds, r = 
-0.51, *(11) = 1.99,/? = 0.072 (Fig. 2c) and significantly cor- 
related with RTs for 6-year-olds, r = -0.71, *(12) = 3.45,/? = 
0.005 (Fig. 2e). Eight-year-olds also showed a robust correla- 
tion between sentence-order bias and RT, r = -0.68, *(11) = 
3.10, p = 0.010 (Fig. 2d), whereas 6-year-olds did not, r = 
-0.29, *(12) = 1.05,/? > 0.25 (Fig. 2f). Although spatial and 
sentence-order bias correlations did not differ statistically for 
8-year-olds, Fisher’s z = 0.57, p > 0.25, spatial bias was a 
significantly stronger predictor of RTs than sentence-order 



Fig. 3 Visualizations of first saccades by display type (target bar on the 
left or right side). Saccades are color coded by RT speed (red = slowest 
third of all RTs, green = fastest third of all RTs in each age group). Dots on 
the end of the lines represent the end points of the saccades 
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type. For target-right displays, children had more slow than 
fast RTs due to left-bar-first saccades. In contrast, adults had 
more fast than slow RTs when they inspected the right bar 
first. We verified these qualitative patterns with quantitative 
evidence by comparing the average number of fastest and 
slowest responses for saccades launched towards the left ver- 
sus right bar as a function of target bar location (see Figure S3 
in the Supplemental Materials available online). Thus, chil- 
dren incurred an RT cost, because they encoded the graph in 
the opposite order as the question, whereas adults showed an 
RT benefit, because they encoded the graph in the same order 
as the question. However, these RT effects only occurred 
when the target bar appeared on the right, suggesting that 
attentional control was more critical for these displays. 

Although children showed stronger left-first biases than 
adults, it is possible that children intended to use the identity 
routine but were involuntary drawn toward the left side when 
the target bar appeared on the right side. If this were the case, 
then children may have spent a shorter amount of time fixating 
the left side before looking towards the right when the target 
bar appeared on the right side of the graph. To test this possi- 
bility, we conducted an ANOVA on the duration of leftward 
fixations with age as a between-group factor (6-, 8-year-olds, 
and adults) and target bar location (left or right) as a within- 
subjects factor. Only trials in which gaze shifted from the left 
to the right side of the graph were included, and if multiple 
consecutive leftward saccades occurred before the rightwards 
shift, fixation duration was summed across consecutive left- 
ward saccades. 

Based on these criteria, five adult participants were exclud- 
ed from the analysis, because they never shifted their gaze 
from left to right in either or both target bar location condi- 
tions. There was a main effect of target location, such that 
leftward fixations were shorter when the target bar appeared 
on the right (M duration =532 ms) than the left (M duration = 
640 ms), F( 1, 35) = 6.60, p = 0.01 5, T | p 2 = 0.16. There also was 
an interaction between age group and target location, F( 2,35) 
= 3.77,/? = 0.033, T| p 2 = 0.1 8. Post-hoc tests revealed that only 
8-year-olds had significantly shorter leftward fixations when 
the target bar appeared on the right (M = 474 ms) versus left 
(M = 718 ms), *(12) = 3.98, p = 0.002, Cohen’s d = 1.35, 
although 6-year-olds showed a trend for shorter leftward fix- 
ations for target-right (M = 526 ms) than target-left displays 
(M= 621 ms), *(13) = 1.56,/? = 0.142, Cohen’s d = 0.51. In 
contrast, for adults, leftward fixation durations were statisti- 
cally similar for target-right (M = 606 ms) and target-left dis- 
plays (M= 564 ms), *(10) = 0.43,/? > 0.25; however, because 
adults rarely attended to the left bar first for target-right dis- 
plays, it is possible that we did not observe any differences in 
left bar gaze duration because of low power. Because children 
were more likely to fixate the left bar for a shorter duration 
when the target appeared on the right than left, it is possible 
that children involuntarily shifted attention towards the left bar 


and quickly reoriented attention towards the right bar. 
Additionally, 8 -year-olds showed a stronger reorienting effect 
than 6-year-olds, suggesting that the ability to compensate for 
default routines also may improve with age. 

General discussion 

Relational perception is a severely capacity limited process. 
Yet, dozens of relational descriptions can be pulled from even 
the simplest displays (Taylor & Tversky, 1992). We show that 
effectively extracting the correct relation for a task relies on 
the way in which a viewer moves their attention through the 
values. We asked college students and elementary school- 
aged children to verify that a graph agreed with a simple 
statement, e.g., “Are there more blueberries than oranges?” 
and found that moving attention in the “correct” way — 
shifting one’s eyes to the relation’s target — predicted fast re- 
sponse times. The “incorrect” strategy — shifting one’s eyes 
to the left bar of the graph, regardless of the question asked — 
led to slow response times; this pattern was especially preva- 
lent among 6-year-olds. Strategy choice had larger behavioral 
consequences when the target bar was on the right side; 
whereas children incurred an RT cost by attending to the left 
(mismatched) bar first, adults experienced an RT benefit by 
attending to the right (matched) bar first. Children also were 
more likely to compensate for initial leftward saccades by 
quickly shifting to the right bar for target-right displays; how- 
ever, 8-year-olds showed stronger reorienting effects than 6- 
year-olds. Together, we show that it is easier to coordinate 
visual relations with verbal information when the two repre- 
sentations are encoded in the same serial order. However, this 
temporal alignment requires attentional control, which is lim- 
ited in younger children. 

Our findings have several implications for the practice of 
integrating relations from multiple representations. For in- 
stance, coordinating relations in text with visual relations 
may be easier if people read text first so that they can attend 
strategically to visualizations (Bergey, Cromley & 
Newcombe, 2015; Bacharach, Carr & Mehner, 1976). 
Efficient extraction of relations may be especially critical for 
processing more complex displays or for generating infer- 
ences about a visualization. Additionally, educators may ben- 
efit from knowing the potentially negative consequences of 
using default left-first routines when teaching young children 
how to interpret visualizations. Child participants may have 
relied more on automatic spatial routines, because children 
often scan from left to right when learning to write and read 
text, charts, and tables in English-speaking cultures (Tversky, 
Kugelmass & Winter, 1991). It therefore is possible that pre- 
literate children might exhibit weaker spatial biases when 
viewing graphs; on the other hand, preliterate English- 
speaking children still demonstrate leftward biases in tasks 
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other than reading and writing, such as object counting (for 
review, see Nuerk et al, 2015). Thus, further research is neces- 
sary to determine whether children younger than age 6 would 
show left-first biases when viewing graphs. Another open ques- 
tion is how strongly children would rely on vertical spatial rou- 
tines, such as attending to a top bar first if the bars extended 
horizontally (a “row graph” as opposed to a “column graph”). 
Previous studies have established an attentional bias for the top 
in adults (Clark & Chase, 1972; Xu & Franconeri, 2015). 
Children also might show an upwards bias, because many 
school-related materials, such as text and charts, begin at the 
top (Tversky, 2011). Because there is a strong mapping between 
vertical space and quantity (Clark & Chase, 1972; Tversky, 
2011), it also is possible that children might rely on the quantifier 
of the question to guide eye movements for vertically arranged 
bar graphs (e.g., showing a top bias for more questions and a 
bottom bias for fewer questions). Although it may be possible to 
teach children to guide their attention in a more efficient manner 
during graph comprehension, another solution is to design dis- 
plays containing text and visualizations for younger children 
such that a spatial routine can be effective in comparing the 
relations expressed in each format. 
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